Discovering Regional Co-location Patterns for Sets of Continuous Variables in Spatial Datasets

نویسندگان

  • Christoph F. Eick
  • Rachana Parmar
  • Wei Ding
  • Tomasz Stepinski
  • Jean-Philippe Nicot
  • Tomasz F. Stepinski
چکیده

This paper proposes a novel framework for mining regional co-location patterns with respect to sets of continuous variables in spatial datasets. The goal is to identify regions in which multiple continuous variables with values from the wings of their statistical distribution are co-located. A co-location mining framework is introduced that operates in the continuous domain without and which views regional co-location mining as a clustering problem in which an externally given fitness function has to be maximized. Interestingness of co-location patterns is assessed using products of z-scores of the relevant continuous variables. The proposed framework is evaluated by a domain expert in a case study that analyzes Arsenic contamination in Texas water wells centering on regional co-location patterns. Our approach is able to identify known and unknown regional co-location patterns, and different sets of algorithm parameters lead to the characterization of Arsenic distribution at different scales. Moreover, inconsistent co-location sets are found for regions in South Texas and West Texas that can be clearly attributed to geological differences in the two regions, emphasizing the need for regional co-location mining techniques. Moreover, a novel, prototype-based region discovery algorithm named CLEVER is introduced that uses randomized hill climbing, and searches a variable number of clusters and larger neighborhood sizes. 1 Author is with the Lunar and Planetary Institute, Houston, TX 77058 2 Author is with Bureau of Economic Geology, Jackson School of Geosciences, University of Texas at Austin, Austin, TX 78712 Finding Regional Co-location Patterns for Sets of Continuous Variables in Spatial Datasets Christoph F. Eick, Rachana Parmar, Wei Ding Department of Computer Science University of Houston Houston, TX 77204-3010 Tomasz F. Stepinski Lunar and Planetary Institute Houston, TX 77058 Jean-Philippe Nicot* Bureau of Economic Geology, Jackson School of Geosciences University of Texas at Austin Austin, TX 78712

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Regional Co-location Patterns for Sets of Continuous Variables

This paper proposes a novel framework for mining regional colocation patterns with respect to sets of continuous variables in spatial datasets. The goal is to identify regions in which multiple continuous variables with values from the wings of their statistical distribution are co-located. A co-location mining framework is introduced that operates in the continuous domain without the need for ...

متن کامل

Mining Of Spatial Co-location Pattern from Spatial Datasets

Spatial data mining, or knowledge discovery in spatial database, refers to the extraction of implicit knowledge, spatial relations, or other patterns not explicitly stored in spatial databases. Spatial data mining is the process of discovering interesting characteristics and patterns that may implicitly exist in spatial database. A huge amount of spatial data and newly emerging concept of Spati...

متن کامل

Discovering Co-location Patterns in Datasets with Extended Spatial Objects

Co-location mining is one of the tasks of spatial data mining, which focuses on the detection of the sets of spatial features frequently located in close proximity of each other. Previous work is based on transaction-free apriori-like algorithms. The approach we propose is based on a grid transactionization of geographic space and designed to mine datasets with extended spatial objects. A stati...

متن کامل

Discovering Statistically Significant Co-location Rules in Datasets with Extended Spatial Objects

Co-location rule mining is one of the tasks of spatial data mining, which focuses on the detection of sets of spatial features that show spatial associations. Most previous methods are generally based on transaction-free apriori-like algorithms which are dependent on userdefined thresholds and are designed for boolean data points. Due to the absence of a clear notion of transactions, it is nont...

متن کامل

A multiple window-based co-location pattern mining approach for various types of spatial data

Studies on spatial co-location mining required distance threshold to define spatial neighbourhood (Shashi Shekhar and Yan Huang(2001); Yoo and Shekhar (2004, 2006); Yasuhiko Morimoto(2001); Koperski and Han(1995); Ding et al. (2008)) However, it is problematical for users to choose suitable threshold values because they lack prior knowledge about spatial data. Spatial neighbourhood has been def...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008